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Intracellular bacterial supply of essential amino acids is common among sap-feeding 
insects, thus complementing the scarcity of nitrogenous compounds in plant phloem. 
This is also the role of the two mealybug endosymbiotic systems whose genomes 
have been sequenced. In the nested endosymbiotic system from Planococcus 
citri (Pseudococcinae), " Candidatus Tremblaya princeps" and " Candidatus Moranella 
endobia" cooperate to synthesize essential amino acids, while in Phenacoccus avenae 
(Phenacoccinae) this function is performed by its single endosymbiont "Candidatus 
Tremblaya phenacola." However, little is known regarding the evolution of essential amino 
acid supplementation strategies in other mealybug systems. To address this knowledge 
gap, we screened for the presence of six selected loci involved in essential amino 
acid biosynthesis in five additional mealybug species. We found evidence of ongoing 
complementarity among endosymbionts from insects of subfamily Pseudococcinae, 
as well as horizontal gene transfer affecting endosymbionts from insects of family 
Phenacoccinae, providing a more comprehensive picture of the evolutionary history of 
these endosymbiotic systems. Additionally, we report two diagnostic motifs to help 
identify invasive mealybug species. 

Keywords: mealybugs, endosymbiosis, "Candidatus Tremblaya princeps", "Candidatus Tremblaya phenacola", 
amino acid biosynthesis, horizontal gene transfer 



INTRODUCTION 

The establishment of permanent intracellular symbioses with 
bacteria has played a key role in insect evolution. Endosymbionts 
are located in specialized eukaryote cells (bacteriocytes) and 
complement the insect's limited heterotrophic metabolism with 
metabolic pathways for the biosynthesis of essential amino acids, 
fatty acids and/or vitamins (Baumann, 2005). Population dynam- 
ics imposed by their lifestyle, together with the stability and 
nutritional richness of the intracellular environment, trigger a 
number of genomic changes in the prokaryote symbiont. Such 
evolutionary changes include drastic genome size reduction, due 
to the loss of genes rendered unnecessary for the association (i.e., 
those that are superfluous in a protected and stable intracellular 
niche or whose function can be provided by the host), and an 
increase in AT content in most analyzed cases (Baumann, 2005; 
Moran et al, 2008; Moya et al., 2008; Gil et al, 2010). Eventually, 
if a second bacterium joins the association, both bacteria coevolve 
and the ongoing reductive genome process affects both of them, 
leading to two possible outcomes: either both bacteria become 
essential for the fitness of the association (complementation), or 
one bacterium undergoes an extreme genome degenerative pro- 
cess, which may end up in its extinction, while the remaining 



bacterium continues the reductive process alone (replacement) 
(Moya et al, 2009). 

As in other phloem-feeding insects, mealybugs rely on 
their endosymbionts for the provision of essential amino acids 
(Baumann, 2005). This fact is supported by the recent genome 
sequencing of endosymbionts from two mealybug species belong- 
ing to subfamilies Pseudococcinae and Phenacoccinae (Lopez- 
Madrigal et al., 201 1; McCutcheon and von Dohlen, 201 1; Husnik 
et al., 2013). Mealybugs from these subfamilies present an intri- 
cate variety of endosymbiotic relationships that reflect both 
complementation and replacement events. Phylogenetic studies 
suggest that a betaproteobacterial ancestor of "Ca. Tremblaya" 
infected a mealybug ancestor before the split of subfamilies 
Phenacoccinae and Pseudococcinae (Hardy et al., 2008). Later 
on, except for the Ferrisia and Maconellicoccus clades (where no 
additional endosymbiont has been reported), the ancestor of 
"Ca. Tremblaya princeps" was infected multiple times by differ- 
ent gammaproteobacteria, establishing a diversity of stable nested 
endosymbiotic consortia, with each "Ca. Tremblaya princeps" 
cell containing several cells of the corresponding gammapro- 
teobacterium (Thao et al, 2002; Gatehouse et al., 2012). By 
contrast, "Ca. Tremblaya phenacola" remained alone in subfamily 
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Phenacoccinae, except in several clades (the tribe Rhizoecini 
and the genus Rastrococcus) where it was replaced by different 
Bacteroidetes (Gruwell et al, 2010; Husnik et al., 2013). 

The most widely studied mealybug endosymbiotic system 
belongs to Planococcus citri (Risso), where "Ca. Tremblaya prin- 
ceps" harbors "Ca. Moranella endobia" (McCutcheon and von 
Dohlen, 2011), with a tight relationship between the nested 
endosymbiosis dynamics and the insect life-cycle (von Dohlen 
et al., 2001; Kono et al, 2008). Independent genomic studies on 
the endosymbiotic consortium of two P. citri strains (PCIT and 
PCVAL) revealed their entangled metabolic complementation for 
the biosynthesis of some essential amino acids (McCutcheon and 
von Dohlen, 2011; Lopez-Madrigal et al., 2013), and necessary 
participation of the insect host (Husnik et al., 2013). Husnik 
and coworkers also reported horizontal gene transfer (HGT) of 
some genes involved in these pathways from diverse bacteria to 
the insect nuclear genome. Even though "Ca. Tremblaya prin- 
ceps" from P. citri exhibits one of the smallest prokaryote genomes 
known so far (139 kb), with an extremely reduced gene set, it 
is the only source for at least 29 enzymes needed to synthe- 
size several essential amino acids (Lopez-Madrigal et al., 2011; 
McCutcheon and von Dohlen, 2011). Functional redundancy 
in both endosymbiotic partners was observed only for dapA 
(involved in lysine biosynthesis) and aroK (involved in pheny- 
lalanine and tryptophan biosynthesis), an indication that each 
bacterium has adopted a specific role in essential amino acid pro- 
vision. Recent sequencing of "Ca. Tremblaya phenacola" PAVE, 
the sole endosymbiont of Phenacoccus avenae (Borchsenius), 
revealed a remarkable case of evolutionary convergence, since it 
has preserved exactly the same set of genes, collectively retained by 
"Ca. Tremblaya princeps" and "Ca. Moranella endobia" in P. citri, 
for supplying essential amino acids to the host (Husnik et al., 
2013). 

In order to study the evolution of essential amino acids 
provisioning within unexplored "Ca. Tremblaya" lineages, we 
have performed a genetic screening on several mealybug 
species, including members of both subfamily Pseudococcinae 
(Dysmicoccus boninsis Kuwana, Pseudococcus viburni Signoret 
and Pseudococcus longispinus Targioni-Tozzetti) and subfamily 
Phenacoccinae (Phenacoccus peruvianus Granara de Willink and 
Phenacoccus madeirensis Green). We searched for the presence 
of genes argH, ilvD, leuB, metE, thrC, and trpB, encoding the 
enzymes that participate in the last steps performed by the 
P. citri and P. avenae endosymbiotic systems in the pathways for 
the biosynthesis of arginine, branched amino acids, methionine, 
threonine and tryptophan, respectively. We have also charac- 
terized the endosymbionts present in these mealybug species 
at the molecular and phylogenetic level. Our results reveal dif- 
ferences among different clades, both at the molecular and 
functional levels, thus providing a more complete picture on 
the complex evolutionary history of the two "Ca. Tremblaya" 
lineages. 

MATERIALS AND METHODS 

INSECT SAMPLE COLLECTION AND DNA EXTRACTION 

Insects belonging to the species D. boninsis, P. viburni, P. longispi- 
nus, P. peruvianus, and P. madeirensis were field collected in 



Valencia (Spain) and stored in absolute ethanol at — 20°C. 
Total DNA (jDNA) extractions were performed with JETFLEX 
Genomic DNA Purification Kit (GENOMED) on 5-8 adult 
females. 

DNA AMPLIFICATION, SEQUENCING AND ANALYSIS 

PCR amplifications were performed on insect jDNA, with 
appropriate primer pairs (see Section Gene Screening), using 
50-60 (i.mol of each primer/50 |il reaction, with KAPATaq DNA 
Polymerase Kit (Kapa Biosystems). The thermal cycling protocol 
was as follows: an initial denaturation at 95° C for 5 min, followed 
by 35 cycles of 50 s at 95°C, 40 s at 56°C (or 52°C when indi- 
cated), and 2 min at 72° C, plus a final extension step of 7 min at 
72° C. When needed, amplicons were cloned using pGEM-T Easy 
Vector System I Kit (Promega). ABI sequencing was performed, 
using specific or vector primers T7 and SP6, at the sequenc- 
ing facility of the Universitat de Valencia. Sequencing reads were 
quality surveyed and assembled with Staden Package (http:// 
staden.sourceforge.net/; Staden et al., 2000). Artemis software 
was used for sequence data management (http://www.sanger.ac. 
uk/resources/software/artemis/; Rutherford et al., 2000). Multiple 
alignments were performed with ClustalW (Larkin et al, 2007). 
MEGA5 was used for the calculation of both p-distance and 
nucleotide composition. 

GENE SCREENING 

Complete sequences of argH (encoding argininosuccinate lyase, 
EC 4.3.2.1, involved in arginine biosyntesis), ilvD (encoding 
dihydroxy-acid dehydratase, EC 4.2.1.9, involved in isoleucine 
and valine biosynthesis), leuB (encoding 3-isopropylmalate dehy- 
drogenase, EC 1.1.1.85, involved in leucine biosynthesis), metE 
(encoding cobalamin-independent homocysteine transmethy- 
lase, EC 2.1.1.14, involved in methionine biosynthesis), thrC 
(encoding threonine synthase, EC 4.2.3.1, involved in threonine 
biosynthesis), and trpB (encoding the beta subunit of trypto- 
phan synthase, EC 4.2.1.20, involved in tryptophan biosynthe- 
sis), were retrieved from GenBank for a set of selected beta 
and gammaproteobacteria (Table SI in Supplementary Material). 
Multiple alignments were performed in ClustalW to allow the 
design of degenerate primers (Table 1) to amplify the corre- 
sponding gene by PCR, using both 52° C and 56° C as annealing 
temperature (T a ) for all primer pairs. jDNA from P. citri was used 
as a positive control. At least 10 clones for each amplicon were 
sequenced. BLAST searches against the non-redundant protein 
database (http://blast.ncbi.nlm.nih.gov/Blast.cgi/; Altschul et al., 
1997) were performed in order to identify the putative taxonomic 
origin of the sequences obtained. 

xDNA from D. boninsis, P. longispinus, P. peruvianus, and 
P. madeirensis were used as templates for PCR amplification 
using 16S rDNA universal primers (Table S2 in Supplementary 
Material; van Ham et al., 1997). The amplicons were cloned as 
above mentioned, and at least 25 clones were sequenced for each 
species. The newly found sequences have been deposited in the 
GenBank database (see Table S3 in Supplementary Material). 
Samples from P. peruvianus and P. madeirensis were also analyzed 
by PCR using gammaproteobacteria-specific 16S rDNA primers 
(Muhling et al, 2008). 
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Table 1 | Degenerate primers used in the gene screening. 



Gene 


Primer 


Sequence (5' -»-3') 


Degeneracy* (%) 


Conserved motif 


argti 


argH-F 


AAYGAYCARRTNGCNACNGA 


35 


NDQ(V/I)ATD 




argH-R 


TC N G G RTTYTTYTTYTG N G G C AT 


26 


MPQKKNPD 


ilvD 


ilvD-F 


ATGTWYACVGCNAAYWCNATG 


33 


M(F/Y)TAN(S/T)M 




ilvD-R 


GARAARAANCKVCCRTCNGT 


35 


SGGT(S/W)G 


leuB 


leuB-F 


GGNGAYGGHATHGGBCCBGA 


30 


GDGIGPE 




leuB-R 


TCVGAVARDATRTCGCCRAA 


30 


FGDILSD 


metE 


metE-F 


AACTAYCACTAYMTVGTVCCNGA 


26 


NYHY(M/I/L)VPE 




metE-R 


CCRCARTCHGGRTTNAYCCA 


30 


W(V/I)NPDCG 


thrC 


thrC-F 


GCDACNTCNGGBGAYACNGG 


30 


ATSGDTG 




thrC-R 


TYNCCRAARTTNCCNSWNGG 


45 


PSGNFG(D/N) 


trpB 


trpB-F 


GTNHTNGGNCARGCNYTNYTNGC 


43 


V(L/I)GQALLA 




trpB-R 


TCNCCNCGNCCNGANAGRTT 


30 


NLSGRGD 



* Primer degeneracy was measured as the proportion of ambiguous sites. 



In order to determine the location of the trpB gene in P. peru- 
viana, we dissected three adult females to separate the head 
from the rest of the body, and extracted jDNA from both sam- 
ples. Universal primers were used to amplify 18S rDNA from 
the host genome (T a = 52°C), as PCR positive control (Table 
S2 in Supplementary Material; Littlewood and Olson, 2001). 
We designed two sets of specific primers to amplify 16S rDNA 
from "Ca. Tremblaya phenacola" and the trpB sequence found 
in P. peruvianus, respectively. All PCR products were sequenced 
using the same primers to confirm their identity. 

A diagnostic screening by restriction enzyme analysis was 
performed on the 16S rRNA genes amplified from P. longispi- 
nus, in order to check for the putative presence of other 
gammaproteobacterial haplotypes previously identified in this 
species (Duron et al., 2008; Rosenblueth et al, 2012). We 
designed a pair of primers in a conserved region of the 16S 
rDNA sequences of the three gammaproteobacterial haplotypes 
and "Ca. Tremblaya princeps" from P. longispinus (Table S2 
in Supplementary Material). RFLP-up and RFLP-down amplify 
the region from sites 516 to 1075 in the E. coli K-12 sub- 
str. MG1655 homolog. PCR products were digested with the 
enzyme Rsa\ (Roche). Restriction digest products were run on an 
agarose gel, stained in ethidium bromide, and visualized with UV 
light. 

FLUORESCENCE IN SITU HYBRIDIZATION (FISH) 

Field collected P. peruvianus adult females were decapitated and 
placed in 4% paraformaldehyde for fixation. Samples were stored 
at 4°C in phosphate buffer saline (PBS) with 0.05% azide until 
preparation for paraffin inclusion. To do so, samples were dehy- 
drated through a graded ethanol series, from 70 to 100% ethanol, 
and finally washed twice in xylene at room temperature for 
30min. Then, they were embedded in paraffin and cut on a 
microtome into 5 u,m thick sections, placed on poly-lysine coated 
slides, air dried, and kept at 4°C. Prior to usage, paraffin sections 



were dewaxed in two xylene baths, followed by two absolute 
ethanol baths of 10 min each. 

In order to permeabilize cellular membranes, samples were 
coated with a few drops of 70% acetic acid while incubated on 
a 60° C hotplate for 1 min. Once rinsed with PBS, they were dehy- 
drated again through a graded ethanol series and air-dried. Slides 
were subsequently coated with 100 u,l of hybridization buffer 
(Tris 20 mM pH 8, NaCl 0.9 M, SDS 0.01% and formamide 
30%) plus 100 ng of the 16S rDNA universal probe Cy5-EUB338 
(Amann et al, 1990) or the specific probe Cy3-TphPPER1290 
( 5' -CCGC AATTCGTACTGAGGTTAGG-3' ) , designed on "Ca. 
Tremblaya phenacola" PPER 16S rRNA gene, and incubated for 
3h at 45° C. To confirm hybridization signal specificity, the fol- 
lowing control experiments were performed: a no-probe control, 
an RNase digestion control (slides treated for 30 min with RNase 
A prior to hybridization), and a competitive suppression control 
with excess unlabeled probe (80ng/|xl of hybridization buffer) 
(Fukatsu et al, 1998). In order to preserve fluorescent signal, 
slides were kept in dark from this point on. After hybridiza- 
tion, they were coated with 1 u,g/mL DAPI for 10 min. Four 
10-min washes were performed with washing buffer (Tris 20 mM 
pH 8, NaCl 112 mM, SDS 0.01% and EDTA 5mM) at 48°C. 
Finally, samples were rinsed twice with milliQ water for 5 min 
at room temperature. Once completely air-dried, slides were 
mounted with Fluoromount-G (Southern Biotech) and kept at 
4°C overnight. 

Slides were observed under an epifluorescence microscope 
(Nikon Eclipse 80i). Nikon DS-QilMc digital camera and NIS- 
Elements BR 3.0 software were used for image capturing and 
processing, respectively. 

PHYL0GENETIC ANALYSES 

Nucleotide sequences used for the phylogenetic analysis were 
retrieved from GenBank or obtained in this work. The 
complete list of sequences, from selected alpha, beta and 



www.frontiersin.org 



August 2014 | Volume 5 | Article 449 | 3 



Lopez-Madrigal et al. 



Analysis of mealybug endosymbiotic lineages 



gammaproteobacteria (including both endosymbiotic and free- 
living species) is presented in Table S3 in Supplementary Material. 

Phylogenetic reconstructions were carried out by Maximun 
Likelihood (ML), Maximum Parsimony (MP) and Bayesian 
methods, in RAxML (Stamatakis, 2006), DNAPARS from 
PHYLIP v3.69 package (Felsenstein, 2005), and MrBayes 3.2 
(Ronquist et al., 2012), respectively. According to JModelTest 
(Guindon and Gascuel, 2003; Darriba et al, 2012), we 
applied a separate general time-reversible evolutionary model 
with gamma- distributed rates and a proportion of invariant 
sites (GTR+I+G) in phylogenetic reconstructions by ML and 
Bayesian methods. In ML and MP reconstructions, bootstrap 
analyses were performed with 1000 replications. In Bayesian 
reconstructions, phylogenetic trees were generated from two runs 
of 200,000 generations for 16S rDNA and two runs of 500,000 
generations for trpB and trpB-argH. Likelihood settings were set 
to nst = 6, rates = invgamma and ngammacat = 4. Sampling was 
performed every 100 generations. First 2300, 3200, and 3500 gen- 
erations were discarded as "burn in" for runs on trpB-argH, 16S 
rDNA and trpB molecular data, respectively. Figures on phyloge- 
netic analysis were prepared with FigTree vl.4.0 software (http:// 
tree.bio.ed.ac.uk/software/figtree/). 

RESULTS 

SCREENING OF GENES INVOLVED IN ESSENTIAL AMINO ACIDS 
BIOSYNTHESIS 

The putative capability for essential amino acid biosynthesis 
among the analyzed species was evaluated by a PCR screening 
of selected genes involved in the last step usually performed by 
the mealybug endosymbiotic systems in the biosynthetic path- 
ways of most essential amino acids. We did not attempt to detect 
genes involved in the biosynthesis of histidine and phenylalanine 



o 



FIGURE 1 | Genetic screening of selected loci involved in essential 
amino acid biosynthesis. The taxonomic assignation of the amplified 
sequences is indicated by black (betaproteobacteria) and red 
(gammaproteobacteria) circles. Empty circles represent pseudogenes. The 
species analyzed in this work appear in bold. P. citri and P. avenae 
endosymbionts are shown for comparison. Cladogram topology represents 



given the impracticality of designing reliable degenerate primers 
on genes hisB and pheA, due to their considerably smaller length 
and/or conservation level. The results of this genetic screening are 
shown in Figure 1. 

argH of gammaproteobacterial origin, according to BLAST 
results, was found in all analyzed Pseudococcinae lineages. 
Although "Ca. Tremblaya princeps" from P. citri also contains 
an argH homolog, it is pseudogenized (Lopez-Madrigal et al., 
2011; McCutcheon and von Dohlen, 2011). Locus degeneration 
is evidenced by two deletions, involving 6 and 57 nucleotides, 
affecting a highly conserved region in analyzed betaproteobac- 
teria and E. coli (gammaproteobacterium), as well as by an 
inactivating frameshift caused by a single cytosine deletion (see 
Supplementary Material). An argH homolog of betaproteobac- 
terial origin was also detected in P. peruvianas, but not in the 
P. madeirensis sample. 

Regarding ilvD, leuB and metE, P. viburni resembles P. citri, 
since they both contain orthologs solely of betaproteobacterial 
origin. In contrast, only an ilvD gene of gammaproteobacterial 
origin was detected in D. boninsis, while no functional homolog 
was detected in P. longispinus. Loci leuB and metE were redundant 
both in D. boninsis and P. longispinus, although the gammapro- 
teobacterial metE homolog in P. longispinus is apparently inac- 
tivated due to a non-sense mutation (TGG— >TAG) affecting a 
highly conserved residue (W140 in the E. coli homolog protein). 
Nevertheless, all the other important residues for protein func- 
tioning examined are still preserved (Table S4 in Supplementary 
Material). MetE key residues identified in E. coli are preserved in 
Burkholderia, the closest free-living betaproteobacterial relative of 
"Ca. Tremblaya," and they are also present in all homolog pro- 
teins of the different "Ca. Tremblaya princeps" strains analyzed. 
Only a non-synonymous He— *-Val change was observed in all of 
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the evolutionary relationships between insect lineages (based on Hardy 
et al., 2008; Gruwell et al., 2010; this work). Marks on the branches 
indicate the infection by the ancestor of "Ca. Tremblaya" (p) and different 
lineages of gammaproteobacteria. y1 corresponds to " Ca. Moranella 
endobia." y2-4 correspond to non-monophyletic gammaproteobacteria, 
based in our phylogenetic analyses. 
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— Dysmicoccus boninsis 
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1 — Pseudococcus longispinus 
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Phenacoccus avenae 

— Phenacoccus peruvianus 
1 — Phenacoccus madeirensis 
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them. Nonetheless, this change is not expected to have functional 
consequences, given the similarities in size and polarity of both 
amino acids. However, this gene was not detected in the two 
Phenococcinae analyzed in this work. 

In the case of thrC, we could not detect this gene in other "Ca. 
Tremblaya princeps" except that from P. citri, while it was identi- 
fied in the two sampled "Ca. Tremblaya phenacola." Finally, only a 
gammaproteobacterial homolog of trpB was found in all analyzed 
mealybugs. 

As expected for mealybugs of subfamily Phenacoccinae, most 
of the amplified genes are of betaproteobacterial origin. The 
genetic screening performed on samples from P. peruvianus and 
P. madeirensis allowed the amplification of five and four out 
of the six screened loci, respectively (Figure 1). This is con- 
sistent with previous descriptions of Phenacoccinae mealybug 
endosymbiotic systems (Husnik et al, 2013; Koga et al., 2013), 
suggesting that "Ca. Tremblaya phenacola" is the only endosym- 
biont in these species too. Unexpectedly, and in contrast with 
its recently described homolog in "Ca. Tremblaya phenacola" 
PAVE (Husnik et al., 2013), the trpB homologs identified in both 
Phenacoccus samples have best similarity hits with gammapro- 
teobacterial proteins. Both sequences are highly similar but not 
identical (p-distance = 0.117), as expected for very closely-related 
orthologs. Additionally, the analysis of their nucleotide composi- 
tion showed an AT-accumulation at codon degenerated positions 
(AT N 3 = 80.2%, contrasting with AT N i = 54.4% and AT N 2 = 
58%), which is common among obligate endosymbionts. Both 



facts appear to discard possible DNA contamination. To confirm 
the gammaproteobacterial origin of such sequences we performed 
a phylogenetic analysis using 771 unambiguously aligned sites 
of the trpB gene from 33 different prokaryote lineages includ- 
ing free-living and endosymbiotic bacteria from classes Beta and 
Gammaproteobacteria (Figure 2). Due to the short length of the 
aligned sequences, the topology, in some cases, does not repro- 
duce the natural clades. Nevertheless, the sequences obtained 
from P. peruvianus and P. madeirensis are clearly located in the 
gammaproteobacterial clade. 

DETERMINATION OF THE LOCATION OF THE trpB GENE IN 
PHENACOCCUS 

The amplification of a trpB gene of gammaproteobacterial ori- 
gin might indicate that a second symbiont is also present in the 
two Phenacoccus species under study. To address this issue, we 
performed a 16S rRNA gene amplification on P. peruvianus and 
P. madeirensis. The PCR products were cloned, and 25 clones 
were sequenced, yielding a single sequence of betaproteobacterial 
origin from each mealybug species (1477 and 1467 bp, respec- 
tively). In order to search for the putative presence of low-density 
gammaproteobacteria in the tested samples, we performed a sec- 
ond PCR screening with gamma-specific primers. xDNA from 
the Pseudococcinae species P. citri, D. boninsis, P. viburni, and 
P. longispinus, where gammaproteobacterial endosymbionts had 
been detected, were used as positive controls. No gammapro- 
teobacteria could be detected on the two Phenacoccus samples 
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FIGURE 2 | ML phylogenetic analysis of the trpB partial nucleotide 
sequences obtained from P. peruvianus and P. madeirensis samples. 

Sequences obtained in this work are in bold; those from the Phenacoccinae 



are underlined. Bayesian and MP analysis gave essentially the same results. 
ML and MP bootstrap values, and Bayesian posterior probabilities over 50% 
are represented. Scale bar represents substitutions per site. 
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FIGURE 3 | PCR screening for gammaproteobacterial endosymbionts. 

Gammaproteobacterial-specific primers were used on jDNA from P. citri 
(lane 2), D. boninsis (lane 3), P. viburni (lane 4), P. longispinus (lane 5), 
P. peruvianus (lane 6), and P. madeirensis (lane 7). The quality of samples 
from P. peruvianus and P madeirensis was tested using 16S rDNA universal 
primers (lanes 9 and 10, respectively). Lanes 8 and 11 are the results of 
negative controls for each pair of primers. 



(Figure 3). The absence of a second bacterium was further con- 
firmed by FISH analysis of P. peruvianus adult females. As it can be 
seen on Figure 4, the bacteriome appears as a well-defined organ 
inside the insect body cavity. It is visible under DAPI staining 
because DNA is found both in the nucleus (insect genome) and 
the cytoplasm (bacterial genomes). The only bacteria detected 
are exclusively located in the bacteriome, and the same fluores- 
cent pattern is observed using both a universal probe and a "Ca. 
Tremblaya phenacola" specific probe. 

To ascertain the location of the amplified trpB gene, either 
in the nuclear or the bacterial genome, we also followed a PCR 
approach (see Materials and Methods; Figure 5). The 18S rRNA 
gene, used as a positive control, was amplified in both head and 
body samples, while no amplification of the 16S rRNA gene was 
obtained from the head sample, thus confirming that it was not 
contaminated with bacteriocytes. As for the 16S rRNA gene, trpB 
was only amplified in the body samples, an indication that it is 
not present in the nuclear genome. Since no other bacterium 
was detected, trpB is likely to be located in the "Cfl. Tremblaya 
phenacola" PPER genome. 

MOLECULAR CHARACTERIZATION AND PHYLOGENETIC ANALYSIS OF 
THE ENDOSYMBIONTS FROM THE ANALYZED MEALYBUGS 

xDNA from D. boninsis, P. longispinus, P. peruvianus, and 
P. madeirensis was used for the amplification of 16S rDNA 
with universal primers, and the corresponding amplicons were 
cloned and sequenced. BLAST analysis of the obtained sequences 
revealed the presence of two different haplotypes, correspond- 
ing to a beta and a gammaproteobacterium, in the two 
Pseudoccocinae species (D. boninsis and P. longispinus), whereas 
the Phenacoccinae (P. peruvianus and P. madeirensis) yielded a 



single haplotype from a betaproteobacterium. We have obtained 
the almost-complete sequence of the 16S rRNA gene (1467 bp) 
of the betaproteobacterium from P. madeirensis, less than 50% 
of which was previously available (Gruwell et al, 2010). The 
sequence of betaproteobacterial origin amplified from P. longispi- 
nus is identical to the one that had been previously identified (Acc. 
no. JN182336). However, the gammaproteobacterial sequence 
obtained in this study (deposited in GenBank under Acc. no. 
KF742536) is not the same as the partial 16S rDNA sequences 
previously reported by Duron et al. (2008) and Rosenblueth et al. 
(2012). In fact, there are 13 polymorphic sites among the three 
haplotypes in the compared region (525 bp). A diagnostic screen- 
ing by restriction enzyme analysis performed on the common 
region of thel6S rRNA genes among "Cfl. Tremblaya princeps" 
and the three gamma-haplotypes from P. longispinus confirmed 
that the sequences previously published were not present in our 
sample (Figure 6). 

The phylogenetic relationship of the endosymbionts under 
study was determined. The analysis was performed on 1221 
unambiguously aligned sites of the 16S rRNA gene from 50 
different prokaryote lineages, including gamma and betapro- 
teobacteria, both free-living and endosymbionts (Figure 7). Two 
separate clades for beta and gammaproteobacteria are well 
defined. Among the beta-endosymbionts, the one from D. bonin- 
sis forms a monophyletic clade with the other "Cfl. Tremblaya 
princeps," while the endosymbionts of both Phenacoccus belong 
to the "Cfl. Tremblaya phenacola" clade. They form a mono- 
phyletic cluster with the endosymbiont from Phenacoccus solani 
(Ferris), and appear separated from the subclade of strain PAVE. 
The obtained tree topology is congruent with that of the hosts 
(Hardy et al., 2008). Regarding the new gamma-endosymbionts 
characterized in this work, those from D. boninsis and P. longispi- 
nus do not cluster together. Nevertheless, these clustering have 
little support. To better define the phylogenetic position of the 
newly described gamma-endosymbionts, we took advantage of 
the argH and trpB genes of gammaproteobacterial origin that 
we detected in the three Pseudococcinae mealybugs analyzed in 
this work (Figure 1). We performed a phylogenetic reconstruc- 
tion using a concatenate of the available sequences from the 
two coding-genes for the same set of gamma-proteobacteria used 
in Figure 7. The analysis include 1308 unambiguously aligned 
sites (489 from argH and 819 from trpB) (Figure 8). This new 
tree confirms that the gamma-endosymbionts of D. boninsis and 
P. longispinus do not belong to the same clade. Furthermore, 
the gamma-endosymbionts of the two Pseudococcus species ana- 
lyzed (P. viburni and P. longispinus) cluster with the Sodalis-Wkt 
endosymbionts, but they do not form a monophyletic group 
either with "Cfl. Moranella endobia" or between them. 

Finally, we performed a molecular characterization of the 
16S rDNA sequences from the analyzed beta-endosymbionts 
from the two Phenacoccus species under study. Strain PPER 
(from P. peruvianus) and PMAD (from P. madeirensis) have a 
GC-content of 48.5 and 48.1%, respectively. This GC-content 
fits well into the range described for the other characterized 
"Cfl. Tremblaya phenacola" strains (from 45.8% in Peliococcus 
turanicus Kiritshenko to 50.6% in Mirococcus sp.), and it is 
clearly lower than the GC-content of the 16S rRNA gene from 
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FIGURE 4 | FISH analysis of P. peruvianas bacteriome. Adult female 
mealybugs stained with DAPI (blue: A,D,E) and probed with Cy5-EUB338 
(green: B,D,E) and Cy3-TphPPER1290 (red: C-E). (A-D) Complete insect 



section showing a compact bacteriome. (E) Amplification of the region 
indicated in the dashed square in (D) to show the endosymbiotic system in 
more detail. Scale: 100 |xm. 
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FIGURE 5 | PCR analysis on T DIMA extracted from P. peruvianas. Heads 


(lines 2, 5, 8 


and bodies (lines 3, 6, 9) were analyzed. Lane 1 , Molecular 


Weight Marker; lines 2-4, amplification of 18S rDNA; lanes 5-7, 


amplification of the 16S rDNA; lanes 8-10, amplification of trpB. Negative 


controls: lanes 4, 7 10. 



"Ca. Tremblaya princeps" (whose known minimum is 55.4% 
in Pseudococcus comstocki Kuwana; Koga et al., 2013). Multiple 
sequence alignment of the 16S rDNA sequences from "Ca. 
Tremblaya phenacola" strains PPER and PMAD revealed that 



both strains present four out of five motifs used by Gruwell et al. 
(2010) to define this species. However, in both cases the motif 
AGTT is modified to AGCT (positions 1240-1243 in the sequence 
from strain PPER). Additionally, the motifs AATGTC and TTTTA 
(sites 160-165 and 1 121-1 125, respectively, in PPER), also present 
in "Ca. Tremblaya phenacola" from P. solani, appear to be unique 
for this subclade members. 

DISCUSSION 

The availability of the complete "Ca. Tremblaya" genomes from 
two mealybug species revealed that "Ca. Tremblaya phenacola" 
PAVE alone is able to provide its host with the same essential 
amino acid biosynthetic capabilities as the consortium composed 
by "Ca. Tremblaya princeps" and "Ca. Moranella endobia" in 
P. citri (Husnik et al., 2013). However, no other mealybug species 
have been thoroughly analyzed for their essential amino acids 
biosynthetic capabilities. To address this issue, we performed 
a genetic screening in five endosymbiotic systems from differ- 
ent subclades of the mealybug subfamilies Pseudococcinae and 
Phenacoccinae. We screened for selected genes involved in the 
last step usually performed by the endosymbionts in the biosyn- 
thetic pathways of arginine (argH), branched amino acids (ilvD 
and leuB), methionine (metE), threonine (thrC), and tryptophan 
(trpB) (Figure 1). Many targeted genes were detected, which is 
consistent with the critical relevance of these bacteria in essential 
amino acid supply. Even though we were unable to obtain some 
amplicons, primer-pair amplification problems seem an improb- 
able cause, because different beta and gammaproteobacteria were 
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FIGURE 6 | Endosymbionts survey in P. fongispinus. Left panel: 

Restriction maps for the four putatively present 16S rDNA sequences, 
identified by their accession numbers in GenBank, i.e., "Ca. Tremblaya 
princeps" (JN182336) and the sequences of gamma-proteobacterial origin 
identified by us (KF742539), Duron et al. (2008) (EU727120), and 
Rosenblueth et al. (2012) (HQ893843). Rsa\ restriction sites and expected 



fragments length are shown. Position numbers refer to the E. colt 
homolog. Right panel: RFLP results. Lanes 1 and 4, Molecular Weight 
Marker; lane 2, undigested amplicon; line 3, amplicon digested with Rsa\. 
A band around 313 bp (labeled with an asterisk) is indicative of the 
presence of the KF742539 sequence (this work). Absence of a band 
around 129 bp rules out the presence of EU727120 and HQ893843. 
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FIGURE 7 | ML phylogenetic analysis of gamma-endosymbionts and 
Tremblaya lineages based on their 1 6S rDNA sequences. Wolbachia pipientis 
Dmel (alphaproteobacterial endosymbiont of Drosophila melanogaster) was 



used as outgroup. Bayesian and MP analysis gave essentially the same 
results. ML and MP bootstrap values, and Bayesian posterior probabilities over 
50% are represented. Scale bar represents substitutions per site. 



successfully amplified in the lineages under study, and no single 
DNA template or primer pair led to complete negative results. 
In any case, apparent absence of some genes in certain lin- 
eages should be interpreted carefully. Negative results do not 
necessarily imply the absence of a certain locus in the endosym- 
biotic system, although they might indicate the absence of a 



functional one. Since the degenerate primers were designed on 
gene regions encoding highly conserved residues, changes affect- 
ing the primer target sequences could affect both gene func- 
tionality and PCR results, potentially preventing the detection of 
pseudogenes. Nevertheless, we were able to amplify some likely 
recent pseudogenes that maintain a high level of identity with 
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FIGURE 8 | Phylogenetic relationships among gamma-endosimbionts 
of mealybugs from subfamily Pseudoccinae, based on concatenated 
sequences of genes argH and trpB. Sequences obtained in this work 
are in bold. The betaproteobacterium Neisseria meningitidis MC58 was 



used as outgroup. Bayesian and MP analysis gave essentially the same 
results. ML and MP bootstrap values, and Bayesian posterior 
probabilities over 50% are represented. Scale bar represents 
substitutions per site. 



predicted functional homologs in closely related species and still 
retain most known critical residues for protein functioning (i.e., 
argH and metE). Functional redundancy tends to be lost fol- 
lowing a stochastic process in endosymbiotic consortia, since 
only one copy is necessary to fulfill host needs. Therefore, the 
detected recent gene inactivation's, as well as the great variety 
in functional redundancies and gene retention patterns among 
endosymbionts of the Pseudococcinae lineages, suggest an ongo- 
ing specification of the role of each endosymbiotic partner in 
metabolic complementation. Moreover, our phylogenetic anal- 
yses (Figures 7, 8) indicate that each lineage was infected with 
different gammaproteobacteria, so that the detected gene losses 
are independent events. Our results indicate that all screened 
genes must have been present in the "Ca. Tremblaya" ances- 
tor. Loss of argH apparently occurred after acquisition of the 
gammaproteobacterial partner in Pseudococcinae. At present, all 
analyzed species have retained an ortholog of gammaproteobac- 
terial origin, but this gene appears to be functional in "Ca. 
Tremblaya princeps" from D. brevipes (Baumann et al., 2002), 
and it is pseudogenized in "Ca. Tremblaya princeps" from P. citri. 
ilvD has been retained by all "Ca. Tremblaya phenacola" analyzed, 
while Pseudococcinae shows various alternatives: in P. citri and 
P. vibumi, it is only present in "Ca. Tremblaya princeps," while 
in D. boninsis the gamma-endosymbiont performs this func- 
tion. leuB has been retained in all "Ca. Tremblaya," whereas it 
is redundant in D. boninsis and P. longispinus. thrC of gamma- 
endosymbiont origin was not detected in any analyzed lineage. 
metE is redundant in D. boninsis and P. longispinus, although 



it is pseudogenized in the gammaproteobacterium of the latter. 
The strict conservation of MetE key residues suggests that "Ca. 
Tremblaya princeps" performs the last step in methionine biosyn- 
thesis from cysteine in all surveyed Pseudococcinae, as well as in 
"Ca. Tremblaya phenacola" PAVE (Husnik et al., 2013). However, 
this gene was not detected in the two Phenococcinae analyzed 
here. The methionine synthase MetH (EC 2.1.1.13) may per- 
form this last step in these endosymbionts, as in "Candidatus 
Hodgkinia cicadicola," endosymbiont of cicadas (McCutcheon 
and Moran, 2010). 

The most intriguing case relates to trpB. Even though 
Phenacoccinae harbor "Ca. Tremblaya phenacola" as a single 
endosymbiont, and in contrast to findings of the genome project 
of strain PAVE (Husnik et al, 2013), both Phenacoccus species 
analyzed here present only a gammaproteobacterial homolog 
(Figure 2). The two sequences we obtained are highly similar 
but not identical, and they present the common AT-content 
bias of P-endosymbionts. Both facts appear to discard DNA 
contamination. However, a PCR screening for gammaproteobac- 
terial endosymbionts gave negative results for both Phenacoccus 
species (Figure 3). FISH and PCR analyses also showed that "Ca. 
Tremblaya phenacola" is the only bacteria found in P. peruvianus, 
where it is confined in the bacteriome (Figures 4, 5). Husnik 
et al. (2013) had recently reported several horizontally transferred 
genes of bacterial origin in the nuclear genome off! citri, some of 
which are involved in the biosynthesis of several nutrients includ- 
ing the amino acid lysine. The source of such genes was not 
any of the members of the mealybug endosymbiotic consortium 
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(i.e., "Ca. Tremblaya princeps" and "Ca. Moranella endobia"), 
even though many of them seem to complement gene loses in 
the consortium genomes. The authors suggest that several facul- 
tative symbionts, which are not essential for host survival, and 
can be free in the environment and infect the host sporadically, 
have been involved in HGT to the insect genome. Altogether, our 
findings suggest that HGT events have also occurred in mealy- 
bugs of the subfamily Phenacoccinae, affecting the evolution of, 
at least, one of the metabolic pathways for essential amino acids 
biosynthesis. In this case, however, the trpB gene appears to have 
been transferred to the "Ca. Tremblaya phenacola" genome. If 
confirmed by sequencing the whole endosymbiont genome, this 
would be (to our knowledge) the second case of HGT described in 
endosymbiotic bacteria. The other described case corresponds to 
"Candidatus Profftella armature," a defensive symbiont from the 
psyllid Diaphorina citri, in which the genes involved in the biosyn- 
thesis of a cytotoxic metabolite appear to have been horizontally 
acquired (Nakabachi et al., 2013). 

The analysis of the obtained 16S rDNA sequences (Figure 7) 
showed the presence of a beta-endosymbiont in all mealybug 
species under study, whereas in the Pseudococcinae there is 
also a gammaproteobacterium, as expected (Thao et al, 2002; 
Hardy et al., 2008; Gruwell et al, 2010). The phylogenetic recon- 
struction using a concatenate of argH and trpB allowed us 
a better characterization of the position of the newly identi- 
fied gamma-endosymbionts (Figure 8). Our analyses show that 
both Pseudococcus gamma-endosymbionts analyzed in this work 
belong to the Sodalis-like clade. However, they do not form a 
monophyletic group either with "Ca. Moranella endobia" or 
between them. On the other hand, the gamma-endosymbiont of 
D. boninsis is not a Sodalis-Yikt bacterium, consistently with the 
gamma-endosymbionts described for other species of the genus 
Dysmicoccus, based on 16S rDNA sequences (Thao et al., 2002). 

The identification of the gamma-endosymbiont of P. longispi- 
nus had been controversial. Some authors were unable to detect 
it (Thao et al, 2002; Gatehouse et al., 2012), while two different 
haplotypes have been identified in other studies (Duron et al., 
2008; Rosenblueth et al., 2012). In all these cases the genetic 
screening was not exhaustive (due to the analysis of a limited 
number of clones), and the amplified sequences were shorter than 
the one obtained in this work. Our results, based on the analysis of 
36 clones, indicate the presence of a single gamma-endosymbiont 
in this species, which is consistent with what had been found 
in other Pseudococcinae. However, our sequence does not corre- 
spond to any of the previously described (Figure 6). The existence 
of different haplotypes might indicate high levels of intraspecific 
polymorphisms in the gamma-endosymbionts of P. longispinus. 
Alternatively, due to the high morphological similarity among 
mealybug species, problems in the identification of insect host 
species cannot be ruled out. 

We have also performed phylogenetic and molecular charac- 
terization of "Ca. Tremblaya phenacola" strains PPER and PMAD. 
Only a few strains have been reported in this species previously, 
and two subclades have been described (Gruwell et al., 2010). 
As revealed by the phylogenetic analysis based on 16S rDNA 
sequences (Figure 7, Table S3 in Supplementary Material), strains 
PPER and PMAD are members of the most unexplored subclade. 



While they present most of the characteristic sequences used by 
Gruwell et al. (2010) to define this species, we also identified 
some unique molecular signatures for this subclade. Considering 
the high morphological similarity exhibited by mealybug species, 
these sequences could be useful as potential targets on strate- 
gies for both bacteria and insect molecular identification (Cox, 
1983; Charles et al, 2000). Specifically, the mealybug species 
P. solani, P. peruvianas, and P. madeirensis, which belong to this 
subclade and are invasive pests of horticultural and ornamental 
plants, represent a relevant threat in several European countries 
(Pellizzari and Germain, 2010). Therefore, the ability to dif- 
ferentiate them, at the molecular level, from other widespread 
polyphagous species such as P. citri, P. viburni, and P. longispi- 
nus, could provide a rapid pests detection tool for import/export 
controls in Europe. 

In summary, our molecular and phylogenetic analyses pro- 
vide a more complete picture of the complex evolutionary history 
of the two "Ca. Tremblaya" lineages. The genetic screening of 
selected genes confirmed the importance of mealybug endosym- 
bionts in providing essential amino acids to their hosts. In 
several Pseudococcinae analyzed, the complementation of "Ca. 
Tremblaya princeps" and the gamma-endosymbionts is ongo- 
ing, given the gene redundancies found. We have also identified 
a putative case of HGT in "Ca. Tremblaya phenacola" for the 
biosynthesis of tryptophan. Finally, from an applied point of 
view, two diagnostic motifs in the 16S rDNA sequence have been 
identified, which could be potentially used to implement a rapid 
detection method to differentiate mealybug pests in horticultural 
and ornamental plants. 

ACKNOWLEDGMENTS 

This work was supported by grants BFU2012-39816- 
C02-01 (co-financed by FEDER funds and Ministerio de 
Economia y Competitividad, Spain) to Amparo Latorre and 
Prometeo/2009/092 (Conselleria d'Educacio, Generalitat 
Valenciana, Spain) to Andres Moya. Sergio Lopez-Madrigal 
and Aleixandre Beltra are recipients of fellowships from the 
Ministerio de Educacion (Spain). 

SUPPLEMENTARY MATERIAL 

The Supplementary Material for this article can be found online 
at: http://www.frontiersin.org/journal/10.3389/fmicb.2014. 
00449/abstract 

REFERENCES 

Altschul, S. E, Madden, T. L. ; Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., et al. 
(1997). Gapped BLAST and PSI-BLAST: a new generation of protein database 
search programs. Nucleic Acids Res. 25, 3389-3402. doi: 10. 1093/nar/25. 17.3389 

Amann, R. I., Binder, B. J., Olson, R. J., Chisholm, S. W., Devereux, R., and Stahl, D. 
A. (1990). Combination of 16S rRNA-targeted oligonucleotide probes with flow 
cytometry for analyzing mixed microbial populations. Appl. Environ. Microbiol. 
56, 1919-1925. 

Baumann, L., Thao, M. L., Hess, J. M., Johnson, M. W., and Baumann, P. (2002). 

The genetic properties of the primary endosymbionts of mealybugs differ 

from those of other endosymbionts of plant sap-sucking insects. Appl. Environ. 

Microbiol. 68, 3198-3205. doi: 10.1128/AEM.68. 7.3198 
Baumann, P. (2005). Biology of bacteriocyte-associated endosymbionts 

of plant sap-sucking insects. Annu. Rev. Microbiol. 59, 155-189. doi: 

10. 1 146/annurev.micro.59.030804. 12 1041 



Frontiers in Microbiology | Microbial Symbioses 



August 2014 | Volume 5 | Article 449 | 10 



Lopez-Madrigal et al. 



Analysis of mealybug endosymbiotic lineages 



Charles, J. G., Froud, K. J., and Henderson, R. C. (2000). Morphological variation 
and mating compatibility within the mealybugs Pseudococcus calceolariae and 
P. similans (Hemiptera: Pseudococcidae), and a new synonymy. Syst. Entotnol. 
25, 285-294. doi: 10.1046/j.l365-3113.2000.00104.x 

Cox, J. M. (1983). An experimental study of morphological variation in mealy- 
bugs (Homoptera: Coccoidea: Pseudococcidae). Syst. Entotnol 8, 361-382. doi: 
10.1 1 1 1/j. 1365-31 13. 1983.tb00489.x 

Darriba, D., Taboada, G. L., Doallo, R., and Posada, D. (2012). jModelTest 2: 
more models, new heuristics and parallel computing. Nat. Methods 9, 772. doi: 
10.1038/nmeth.2109 

Duron, O., Bouchon, D., Boutin, S., Bellamy, L., Zhou, L., Engelstadter, J., et al. 
(2008). The diversity of reproductive parasites among arthropods: Wolbachia 
do not walk alone. BMC Biol. 6:27. doi: 10.1186/1741-7007-6-27 

Felsenstein, J. (2005). Using the quantitative genetic threshold model for infer- 
ences between and within species. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 360, 
1427-1434. doi: 10.1098/rstb.2005.1669 

Fukatsu, T., Watanabe, K., and Sekiguchi, Y. (1998). Specific detection of intracellu- 
lar symbiotic bacteria of aphids by oligonucleotide-probed in situ hybridization. 
Appl Entotnol. Zool. 33, 461-472. 

Gatehouse, L. N., Sutherland, P., Forgie, S. A., Kaji, R., and Christeller, J. 
T. (2012). Molecular and histological characterization of primary (betapro- 
teobacteria) and secondary (gammaproteobacteria) endosymbionts of three 
mealybug species. Appl. Environ. Microbiol. 78, 1187-1197. doi: 10.1128/AEM. 
06340-11 

Gil, R-, Latorre, A., and Moya, A. (2010). "Evolution of prokaryote-animal sym- 
biosis from a genomics perspective," in (Endo)Symbiotic Methanogenic Archaea 
(Microbiology Monographs), ed J. H. P. Hackstein (Berlin; Heidelberg: Springer- 
Verlag), 207-233. 

Gruwell, M. E., Hardy, N. B., Gullan, P. J., and Dittmar, K. (2010). Evolutionary 
relationships among primary endosymbionts of the mealybug subfam- 
ily Phenacoccinae (Hemiptera: Coccoidea: Pseudococcidae). Appl. Environ. 
Microbiol. 76, 7521-7525. doi: 10.1128/AEM.01354-10 

Guindon, S., and Gascuel, O. (2003). A simple, fast, and accurate algorithm to esti- 
mate large phylogenies by maximum likelihood. Syst. Biol. 52, 696-704. doi: 
10.1080/10635150390235520 

Hardy, N. B., Gullan, P. J., and Hodgson, C. J. (2008). A subfamily-level classifi- 
cation of mealybugs (Hemiptera: Pseudococcidae) based on integrated molec- 
ular and morphological data. Syst. Entotnol. 33, 51-71. doi: 10.1 11 1/j. 1365- 
3113.2007.00408.x 

Husnik, E, Nikoh, N., Koga, R., Ross, L., Duncan, R. P., Fujie, M., et al. 
(2013). Horizontal gene transfer from diverse bacteria to an insect genome 
enables a tripartite nested mealybug symbiosis. Cell 153, 1567-1578. doi: 
10.1016/j.cell.2013.05.040 

Koga, R., Nikoh, N., Matsuura, Y., Meng, X. Y, and Fukatsu, T. (2013). Mealybugs 
with distinct endosymbiotic systems living on the same host plant. FEMS 
Microbiol. Ecol. 83, 93-100. doi: 10.1111/j.l574-6941.2012.01450.x 

Kono, M., Koga, R., Shimada, M., and Fukatsu, T. (2008). Infection dynam- 
ics of coexisting beta- and gammaproteobacteria in the nested endosym- 
biotic system of mealybugs. Appl Environ. Microbiol 74, 4175-4184. doi: 
10.1128/AEM.00250-08 

Larkin, M. A., Blackshields, G, Brown, N. P., Chenna, R., McGettigan, P. A., 
McWilliam, H., et al. (2007). Clustal W and Clustal X version 2.0. Bioinformatics 
23, 2947-2948. doi: 10.1093/bioinformatics/btm404 

Littlewood, D. T. J., and Olson, P. D. (2001). "SSU rDNA and the platy- 
helminthes: signal, noise, conflict and compromise," in Interrelationships of the 
Platyhelminthes, eds D. T. J. Littlewood and R. A. Bray (London: Taylor and 
Francis), 262-278. 

Lopez-Madrigal, S., Latorre, A., Porcar, M., Moya, A., and Gil, R. (2011). Complete 
genome sequence of "Candidatus Tremblaya princeps" strain PCVAL, an 
intriguing translational machine below the living-cell status. /. Bacteriol. 193, 
5587-5588. doi: 10.1128/JB.05749-11 

Lopez-Madrigal, S., Latorre, A., Porcar, M., Moya, A., and Gil, R. (2013). Mealybugs 
nested endosymbiosis: going into the 'matryoshka' system in Planococcus citri in 
depth. BMC Microbiol. 13:74. doi: 10.1186/1471-2180-13-74 

McCutcheon, J. P., and Moran, N. A. (2010). Functional convergence in reduced 
genomes of bacterial symbionts spanning 200 My of evolution. Genome Biol. 
Evol. 2, 708-718. doi: 10.1093/gbe/evq055 



McCutcheon, J. P., and von Dohlen, C. D. (2011). An interdependent metabolic 

patchwork in the nested symbiosis of mealybugs. Curr. Biol. 21, 1366-1372. doi: 

10.1016/j.cub.2011.06.051 
Moran, N. A., McCutcheon, J. P., and Nakabachi, A. (2008). Genomics and evo- 
lution of heritable bacterial symbionts. Anna. Rev. Genet. 42, 165-190. doi: 

10.1 146/annurev.genet.41. 110306.130119 
Moya, A., Gil, R., and Latorre, A. (2009). The evolutionary history of symbiotic 

associations among bacteria and their animal hosts: a model. Clin. Microbiol. 

Infect. 15 (Suppl. 1), 11-13. doi: 10.1111/j.l469-0691.2008.02689.x 
Moya, A., Pereto, J., Gil, R., and Latorre, A. (2008). Learning how to live 

together: genomic insights into prokaryote-animal symbioses. Nat. Rev. Genet. 

9, 218-229. doi: 10.1038/nrg2319 
Muhling, M., Woolven-Allen, J., Murrell, J. C, and Joint, I. (2008). Improved 

group-specific PCR primers for denaturing gradient gel electrophoresis analysis 

of the genetic diversity of complex microbial communities. ISME J. 2, 379-392. 

doi: 10.1038/ismej.2007.97 
Nakabachi, A., Ueoka, R., Oshima, K, Teta, R., Mangoni, A., Gurgui, M., et al. 

(2013). Defensive bacteriome symbiont with a drastically reduced genome. 

Curr. Biol. 23, 1478-1484. doi: 10.1016/j.cub.2013.06.027 
Pellizzari, G., and Germain, J. F. (2010). "Scales (Hemiptera, Superfamily 

Coccoidea)," in Alien Terrestrial Arthropods of Europe (Biorisk), eds A. Roques, 

M. Kenis, D. Lees, C. Lopez- Vaamonde, W. Rabitsch, J. Y. Rasplus, and D. B. Roy 

(Sofia: Pensoft Publishers), 475-510. 
Ronquist, F., Teslenko, M., van der Mark, P., Ayres, D. L., Darling, A., Hohna, S-, 

et al. (2012). MrBayes 3.2: efficient Bayesian phylogenetic inference and model 

choice across a large model space. Syst. Biol. 61, 539-542. doi: 10.1093/sys- 

bio/sys029 

Rosenblueth, M., Sayavedra, L., Samano-Sanchez, H., Roth, A., and Martinez- 
Romero, E. (2012). Evolutionary relationships of flavobacterial and enterobacte- 
rial endosymbionts with their scale insect hosts (Hemiptera: Coccoidea). /. Evol 
Biol 25, 2357-2368. doi: 10.1 1 1 1420-9 10 1 .20 1 2.026 1 1.x 

Rutherford, K, Parkhill, J., Crook, J., Horsnell, T, Rice, P., Rajandream, M. A., 
et al. (2000). Artemis: sequence visualization and annotation. Bioinformatics 16, 
944-945. doi: 10.1093/bioinformatics/16.10.944 

Staden, R., Beal, K. E, and Bonfield, J. K. (2000). The Staden package, 1998. 
Methods Mol. Biol. 132, 115-130. doi: 10.1385/1-59259-192-2:115 

Stamatakis, A. (2006). RAxML-VI-HPC: maximum likelihood-based phyloge- 
netic analyses with thousands of taxa and mixed models. Bioinformatics 22, 
2688-2690. doi: 10.1093/bioinformatics/btl446 

Thao, M. L., Gullan, P. J., and Baumann, P. (2002). Secondary (v-proteobacteria) 
endosymbionts infect the primary (fi-proteobacteria) endosymbionts of mealy- 
bugs multiple times and coevolve with their hosts. Appl. Environ. Microbiol 68, 
3190-3197. doi: 10.1128/AEM.68.7.3190 

van Ham, R. C, Moya, A., and Latorre, A. (1997). Putative evolutionary origin 
of plasmids carrying the genes involved in leucine biosynthesis in Buchnera 
aphidicola (endosymbiont of aphids). /. Bacteriol. 179, 4768-4777. 

von Dohlen, C. D., Kohler, S., Alsop, S. T, and McManus, W. R. (2001). Mealybug 
beta-proteobacterial endosymbionts contain gamma-proteobacterial sym- 
bionts. Nature 412, 433-436. doi: 10.1038/35086563 

Conflict of Interest Statement: The authors declare that the research was con- 
ducted in the absence of any commercial or financial relationships that could be 
construed as a potential conflict of interest. 

Received: 06 June 2014; accepted: 06 August 2014; published online: 26 August 2014. 
Citation: Lopez-Madrigal S, Beltra A, Resurreccion S, Soto A, Latorre A, Moya A and 
Gil R (2014) Molecular evidence for ongoing complementarity and horizontal gene 
transfer in endosymbiotic systems of mealybugs. Front. Microbiol. 5:449. doi: 10.3389/ 
fmicb.2014.00449 

This article was submitted to Microbial Symbioses, a section of the journal Frontiers in 
Microbiology. 

Copyright © 2014 Lopez-Madrigal, Beltra, Resurreccion, Soto, Latorre, Moya and 
Gil. This is an open-access article distributed under the terms of the Creative Commons 
Attribution License (CC BY). The use, distribution or reproduction in other forums is 
permitted, provided the original author(s) or licensor are credited and that the original 
publication in this journal is cited, in accordance with accepted academic practice. No 
use, distribution or reproduction is permitted which does not comply with these terms. 



www.frontiersin.org 



August 2014 | Volume 5 | Article 449 | 11 



